The Meaning of Taflig: Distributional Similarity for Rare Words
نویسنده
چکیده
My area of research is the relationship between a word’s semantics (or meaning) and its syntactic behaviour. The underlying idea is that if two words mean similar things then they will occur in documents in similar contexts. In other words, in a given sentence it is generally possible to substitute each word with a synonym (i.e. another word with the same meaning). For example, in the following sentences we have the unknown word ‘taflig’. I like TAFLIG. TAFLIG osts more than it used to. I drink TAFLIG. Drinking TAFLIG gives me a hangover. There are lots of TAFLIG ans in the re y ling bin.
منابع مشابه
Word meaning in context: a probabilistic model and its application to question answering
The need for assessing similarity in meaning is central to most language technology applications. Distributional methods are robust, unsupervised methods which achieve high performance on this task. These methods measure similarity of word types solely based on patterns of word occurrences in large corpora, following the intuition that similar words occur in similar contexts. As most Natural La...
متن کاملFrom distributional to semantic similarity
Lexical-semantic resources, including thesauri and WORDNET, have been successfully incorporated into a wide range of applications in Natural Language Processing. However they are very difficult and expensive to create and maintain, and their usefulness has been severely hampered by their limited coverage, bias and inconsistency. Automated and semi-automated methods for developing such resources...
متن کاملThe distributional hypothesis∗
Distributional approaches to meaning acquisition utilize distributional properties of linguistic entities as the building blocks of semantics. In doing so, they rely fundamentally on a set of assumptions about the nature of language and meaning referred to as the distributional hypothesis. This hypothesis is often stated in terms like “words which are similar in meaning occur in similar context...
متن کاملInside Out: Two Jointly Predictive Models for Word Representations and Phrase Representations
Distributional hypothesis lies in the root of most existing word representation models by inferring word meaning from its external contexts. However, distributional models cannot handle rare and morphologically complex words very well and fail to identify some finegrained linguistic regularity as they are ignoring the word forms. On the contrary, morphology points out that words are built from ...
متن کاملDistributional Learning of Appearance
Opportunities for associationist learning of word meaning, where a word is heard or read contemperaneously with information being available on its meaning, are considered too infrequent to account for the rate of language acquisition in children. It has been suggested that additional learning could occur in a distributional mode, where information is gleaned from the distributional statistics (...
متن کامل